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We obtain two theorems extending the use of a saddlepoint ap- 
proximation to multiparameter problems for likelihood ratio-like statis- 
tics which allow their use in permutation and rank tests and could be 
used in bootstrap approximations. In the first, we show that in some 
cases when no density exists, the integral of the formal saddlepoint 
density over the set corresponding to large values of the likelihood 
ratio-like statistic approximates the true probability with relative er- 
pIZh , ror of order 1/n. In the second, we give multivariate generalizations 

(— j ■ of the Lugannani-Rice and Barndorff-Nielsen or r* formulas for the 

approximations. These theorems are applied to obtain permutation 
tests based on the likelihood ratio-like statistics for the k sample and 
the multivariate two-sample cases. Numerical examples are given to 
illustrate the high degree of accuracy, and these statistics are com- 

^- , pared to the classical statistics in both cases. 

^D 

1. Introduction. In parametric problems where distributions are spec- 
if) , ified exactly, the likelihood ratio is generally used for hypothesis testing 
whenever possible. In multiparameter problems, the distribution of twice 
the log likelihood ratio is approximated by a chi-squared distribution. Re- 
finements of this approximation were obtained by Barndorff-Nielsen [2] for 
parametric problems. In a nonparametric setting the empirical exponential 
likelihood is described in Chapter 10 of [6] and discussed in a number of 
references cited there. Saddlepoint approximations for empirical exponen- 
5JJ . tial likelihood statistics based on multiparameter M-estimates are given, for 
example, in [12] and for tests of means in [10], under the strong assumption 
that the density of the M-estimate exists and has a saddlepoint approxima- 
tion. They used methods based on those of [3] to obtain an approximation 
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analogous to the Lugananni-Rice approximation for the one-dimensional 
case. 

It is the purpose of this paper to show that, under conditions which will 
allow the application of the approximations in bootstrap, permutation and 
rank statistics used for multiparameter cases, the integral of the formal 
saddlepoint density approximation can be used to give an approximation 
with relative error of order n" 1 to the tail probability of a likelihood ratio-like 
statistic. This then permits the approximation to be put in the Lugananni- 
Rice form as in [12] and also in a form analogous to the r* or Barndorff- 
Nielsen form given in [2] and [8] for the one-dimensional case. These results 
are then applied to two multiparameter nonparametric cases. We require 
the existence of a moment generating function. This may be too strong 
an assumption in the case of tests concerning means considered here, but 
robust versions of these, as in [12], can be used to make the results widely 
applicable. 

The statistic used is obtained by using the conjugate distribution ap- 
proach of [5] and is the log likelihood ratio in the parametric case of ex- 
ponential families. It can be written as a convex function of X, the mean 
of n independent random variables. This statistic can be approximated to 
first order by a quadratic form in the means X. However, it does not seem 
to be possible to approximate tail probabilities for quadratic forms with 
relative errors of order n _1 , as are obtained for our statistic. Cramer large 
deviation results for the case of quadratic forms in multivariate means were 
obtained by [9] and a number of earlier authors cited in that paper, but 
the relative errors for the approximation to the probability of the statis- 
tic, a random variable of order 1/n, exceeding A is of order \fn\n ' . So 
the relative error is at best of order n" 1 ' 4 . The same problem arises in the 
case of an empirical likelihood statistic, where we know of no saddlepoint 
approximation. 

In the next section we introduce the notation and assumptions necessary 
to obtain the likelihood ratio-like statistic, tail probabilities of which can 
be used for hypothesis testing in multivariate nonparametric settings. We 
reduce certain conditional cases given lattice variables to a more convenient 
notation and state the main result in a theorem showing that tail proba- 
bilities for the statistic can be approximated, to relative order n _1 , by an 
integral of a formal saddlepoint density. We then state and prove a theorem 
giving the integrals in forms like those of Lugananni-Rice and Barndorff- 
Nielsen in the one-dimensional case. In Section 3 we consider two examples 
of permutation tests, for the fc-sample problem and for a two sample multi- 
variate permutation test, using the results of the previous section to obtain 
explicit formulas for test statistics and for the approximations of the tail 
probabilities of these statistics under permutations. We then present numer- 
ical examples illustrating the accuracy of the approximations and comparing 
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results to those obtained using the standard sum of squares test statistics 
for the fc-sample permutation and rank tests and the Mahalanobis D 2 test 
for the 2-sample multivariate test. In the final section we give the proof of 
the main result. 

2. Notation and main result. For a sample of size n with mean vec- 
tor x from a parametric canonical exponential family with density f T (x) = 
exp(r x— k(t))p(x), the maximum likelihood estimate of r is r, the solu- 
tion of k'(t) = x, and, taking fi/(0) = 0, the log likelihood ratio statistic is 
A(x) = r T x — k(t). This is used to test the hypothesis that r = 0, or equiv- 
alently, that k'(t) = 0. For the nonparametric case an empirical exponential 
family is taken, and it is shown, for example, in [12], page 1163, that the em- 
pirical exponential likelihood ratio statistic for a test that the expectation is 
zero is A(x) = -(3jx + n n (f3 ), where K n (/3) = logEr=i ex P(/ ?Tx «)]/ n and A) 
is the solution of n' n ([3) = 0. In [12] a bootstrap approximation can be based 
on the statistic A(x*) = r x* — k h (Pq + t) + K n (/3o)> where the bootstrap 
is taken from the tilted empirical distribution -Fo( x ) — SILi ex P(A) x « ~~ 
K n (/?o))/{xj < x}/n. A saddlepoint approximation to this bootstrap is given, 
but it is noted that the relative errors of this approximation could not be 
proven from the theorem of that paper. The theorems of this section permit 
this proof. We use an analogous approach to give the likelihood ratio-like 
statistics for the two permutation test examples in the next section. 

Consider independent <i-dimensional random vectors Xi , . . . , X n , with the 
first do components X\j , . . . , Xd j confined to a lattice with unit spacings, 
for do < d, and with the average cumulant generating function 

n 

(1) k(t) = n~ x log(£e TTs ' 1 ) = n" 1 ^log(£e TTx <), 

i=\ 

where S n = Xi + • • • + X n . For some x we can define 

(2) A(x) = f T x - k(t ) 
for r satisfying 

(3) *'(f)=x 
and 

(4) r(x) = e- nA M(27m)- do/2 (2-K/n)- dl / 2 \V+\- 1/2 . 

In the case when the last d\ components of X 1; . . . ,X n have densities, this 
is the saddlepoint density approximation for X = S n /n, obtained in the case 
of identically distributed random vectors in [4]. In many cases when these 
last components lack a density, the theorem below will imply that their 
distribution may be well approximated by a continuous distribution. 



4 J. KOLASSA AND J. ROBINSON 

Let \i denote the distribution of X = S re /n, let 0* = {r : k(t) < 00} , and 
let fi T (dy) =exp(— 7i(k(t) — T T y))[i(dy) define the distribution of X T , the 
mean of Xi T , . . . ,X nT , the associated independent random vectors. These 
conjugate distributions, first introduced in [5], permit us to consider large 
deviations. Let V T = k"(t), and, taking ||x|| = (x x) 1 ' 2 , let 

n 

m {T) = n- 1 Y J E[\\V^ l2 {^T-E[K lT ])n 
*=l 

Let 

q T (T) =sup{|e^+^- K M| : \\vl /2 £\\ > (3/4)r ?3 (r)- 1 , 

(5) 

|£i| <7r for i <do,\€i\ <T,i >do}. 

We consider the following conditions, essentially from [11], where, through- 
out, c and C are generic positive constants, and \A\ denotes the determinant 
of a square matrix A. The complexity of these conditions is due to the fact 
that we need to consider conditional distributions of independent, but not 
identically distributed, random variables. 

• (Al) There is a compact subset, 0, of the interior of 0*, with in the 
interior of 0. 

• (A2) \V T \ >c>0 for rG0. 

• (A3) i]j(t) < C for j = 1, . . . , 5 and r € 0. 

• (A4) n 2d ^ +2 q T {n- 2 )<C. 

Here the first condition asserts that there is an open neighborhood of the 
origin where the cumulative generating function exists. The second condi- 
tion bounds the average variance of the associated random variables away 
from zero, and the third gives upper bounds the first 5 standardized mo- 
ments in this neighborhood. The fourth condition is a smoothness condition 
introduced first for the univariate case in [1] and which is sufficient to allow 
Edgeworth expansions for many statistics based on ranks and applications 
to bootstrap and permutation statistics when the original observations are 
from a continuous distribution. 

Let X = k'(0); then we are able to obtain equations (2), (3) and (4) 
for x G X. Also, if d§ > 0, let Ao(xo) = r xo — ko(^o) for to satisfying 
Kq(to) =xo, where the subscript denotes a reduction to the first do ele- 
ments of the d- vectors, and we will use the subscript 1 to denote the last d\ 
elements. If ro(xo) = (27rn) _d °' 2 |Vf | _1 ' 2 exp(— nAo(xo)), then from [4], we 
have P(Xo = xo) = /Uo(xo) = ro(xo)(l + 0(l/n)). For do > 0, we will consider 
x T = (x^x^) and replace r(x) by 

\y. |l/2 -n(A(x)-A (x )) 
I "to I e 



(6) r(xi|x ) =r(x)/r (x ) 



(27r/n) d i/ 2 |"^| 1 /2 
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and replace \i by the distribution of Xi conditional on Xo = xo , so that we 
consider conditional probabilities of Xi given Xo = xo , associated with sets 
of form 

J- = {x 1 :A(x)-A (x )>A,x T = (x r ,x7)}. 

The main result is the following theorem, whose proof is deferred to a later 
section. 



Theorem 1. Under conditions (Al)-(A4), 



(7) 



H(F> 



/ r(xi|x )(ixi = / r(x 1 |x )dx 1 



0(l/n) 



Note that if the nonlattice subvectors of Xi, . . . ,X n have densities, the 
variables are identically distributed and (Al) and (A2) hold, then the the- 
orem follows from Theorem 1 of [4]. 

The following theorem is a corollary whose derivation we include here. 
This is the form that will be used in examples. 



Theorem 2. Under the conditions of Theorem 1, ifu= v2A, 

(8) / r(xi|x ) dxi = Q dl (nu 2 )[l + 0(l/n)] + — u dl e 
JT n 



u 2 



JT 

and 

(9) J r(x 1 |x )dx 1 = Q dl (nn* 2 )[l + O(l/n)], 

where Q d (x) = P(x 2 d > x), 

(10) u* = u - Iog(G(u))/nu, 

n di/2 
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2*/2-ir(d!/2) ' 



(i 2 ) S( ^ s)= mm^a^i 

2^/2^1 - 2 |s T V 2 fi | 



(13) G(u)= / 5(u,s)ds 

Js dl 

for S dl the d\- dimensional unit sphere centered at zero, and where, for each 
s G S dl , r is chosen so A(xo,rs) — Ao(xo) = A and Vq = [k"(0) _1 ]h ; with 
the final subscripts denoting the lower right d\ x d\ submatrix. 

Proof. The derivation of (8), given Theorem 1, is given in [12]. To 
get (9), we use a related method. After making the transformations y = 

— 1/2 

V xi, y — > (r, s) and (r, s) — > (u,s), where the first is the polar transfor- 
mation with ||x|| = r and s G S d , the unit sphere in (i-dimensions, and the 
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second has u = y / 2(A(xo,rs) — Ao(xo)), we have 

r /'OO 

/ r(xi|x )dxi = c n / v d - 1 e- nv ,2 G{v)dv 

v d-l e -n(v-\o g G{v)/nv) 2 /2 ^ + (l/n)). 



Then make the transformation v* = v — log G(v)/nv. The final equality fol- 
lows since G(v) = 1 + v 2 k(v) and G'(v) = vk*(v), where k(v) and k*(v) are 
bounded as shown in [12]. □ 

Remark. The integral (13) can be approximated by a Monte Carlo 
method, for example, by approximating J s h(s) ds as 

27T d/2 1 M 

where U\, . . . ,Um are i.i.d. uniformly distributed on Sd- Here the number 
of replicates in the Monte Carlo simulation can be small with little loss of 
accuracy. We discuss this in the examples where it was found that M = 10 
was sufficient. It would be possible to use a method such as that in [7] to 
get a numerical approximation to the integral, but the Monte Carlo method 
is much simpler to use and easily gives the required accuracy. 

3. Two examples of permutation tests. We consider a k sample permu- 
tation test in a one-way design and a multivariate two-sample permutation 
test. In both cases we consider hypotheses that the populations of random 
variables or vectors are exchangeable. In the first case the observations are 
generated either by sampling m, . . . , n^ independent random variables from 
distributions iq , . . . , i*j. , and we test Hq : F\ = ■ • ■ = F^ , or they are gener- 
ated from an experiment in which k treatments are allocated at random to 
groups of sizes m,...,rik, and we test Hq\ treatments have equal effects. We 
choose a statistic suitable for testing with respect to differences in means. 
The standard choices of test statistic are the F-statistic from the analysis 
of variance or, for a nonparametric test based on ranks, the Kruskal-Wallis 
statistic. In the second case the observations are generated by sampling from 
two populations of /-dimensional random vectors, and we test for equality 
of the distributions, or they are generated by experimental randomization, 
and we test for equality of two treatments. Here the test statistic arising 
from an assumption of multivariate normality is the Mahalanobis D 2 test. 

3.1. Permutation tests for k samples. Suppose that ai,...,ajv are the 
elements of a finite population, such that J2m=i a m = and X^m=l a ? = ^ '• 
Let m, . . . ,rik be integers, such that N = J2i=i n i- Suppose that Ri, . . . , Rjy 
is an equiprobable random permutation of 1, . . . , N. Let X^ = an n + ... +n . +j , 

and let X t = Y^jLi ^ij/m. 
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For i = 1, . . . , k — 1, let ej have fe — 1 components, with component i equal 
to 1, and other components zero. Let I m , m = 1, . . . , N be independent and 
identically distributed random vectors with P(I TO = e^) = rii/N = pi for i < k 
and P(I m = 0) = n k /N = p k . Let S T = (£^ =1 *L Em=l <*ml£) = (S T , S[). 
We have 

P(nX < x) = P(Si < x|S = iVp), 

where X,x, p are k — 1 vectors corresponding to the first k — 1 samples. 
Under Hq, the cumulant generating function of S is 

N / fc-1 \ 

Nk(t ,Ti) = logEe^= l( ~ T o lm+T ^ Imam ^ = ^ logf p fc + ^pje 7 " ^ 1 ' " 71 ] . 

m=l V i=l / 

Let (t , r 1 ) be the solution of 

«'(to,ti) = (p,x). 

Let -B = {x: A(x) > u 2 /2}, where A(x) = r p + r 1 x — k(to,ti), and note 
that k'(0,0) = (p,0) and «(0,0) = 0. Now from Theorem 1, if g T (n~ 2 ) = 
0(n~ 2k ), 

P(A(X) > u 2 /2) = / r(x|p) dx(l + 0(1/N)), 
Jb 

where 

r(x|p) = (27r/^)-(^i)/2| K00 ( ,o)| 1 /2| K "(^ ^ 1 )|"i/2 e -JVA(x) > 

Then from Theorem 2, G(u) is given in (12) and (13) with do = d\ = k — 1. 
Now we can use (8) and (9) to get the two approximations. 

3.2. Numerical results for k-sample test. Consider first the rank test 
based on the statistic A(X) where oi = 1, . . . , oln = N, with N = 20 for 4 
groups of size 5; in the standard case the Kruskal-Wallis test would be used. 
The following table gives the results of tail probabilities from a Monte Carlo 
simulation of A(X) (MC A) and of the Kruskal-Wallis statistic (MC K-W) 
using 100,000 permutations, the chi-squared approximation (%|) and the 
saddlepoint approximations using (8) (SP LR A) and (9) (SP BN A), using 
M = 1000 Monte Carlo samples from S3. Inspection of the table compar- 
ing the saddlepoint Lugananni-Rice and Barndorff-Nielsen approximations 
with the Monte Carlo approximation for A shows the considerable accuracy 
of these approximations throughout the range. The chi square approxima- 
tions to the distribution of the Kruskal-Wallis statistic does not have this 
degree of accuracy. We note that good approximations for the saddlepoint 
approximations are achieved by M as small as 10. We obtained the standard 
deviation of individual random values of the integrand and noted that for 
Table 1 this was 0.003 for u = 0.6 and 0.0007 for u = 0.9, indicating that 
M = 10 gives sufficient accuracy in this example. 
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Table 1 
The ^-sample rank tests with m = 5 



u 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


MC A 


0.6758 


0.4328 


0.2365 


0.1087 


0.0423 


0.0142 


0.0041 


MC K-W 


0.6583 


0.4027 


0.1921 


0.0652 


0.0135 


0.0012 


0.0000 


x§ 


0.6149 


0.3618 


0.1718 


0.0658 


0.0203 


0.0051 


0.0010 


SP LR A 


0.6811 


0.4446 


0.2454 


0.1151 


0.0464 


0.0164 


0.0052 


SP BN A 


0.6753 


0.4380 


0.2387 


0.1101 


0.0434 


0.0148 


0.0045 



Also consider the permutation test based on a single sample of 40 in 4 
groups of 10 from an exponential distribution, comparing as above each of 
the saddlepoint approximations with the Monte Carlo approximations in 
this case and with the standard test based on the sum of squares from an 
analysis of variance. The same pattern of accuracy as reported above is 
apparent from inspection of Table 2. 

3.3. A two-sample multivariate permutation test. Let ai,...,ajy be l- 
vectors regarded as elements of a finite population such that £)t=i a^ = and 
Yli=i a * a f = NI. Let Ri,...,Rn be obtained by an equiprobable random 
permutation of 1, . . . , TV, let Xj = a^. , j = 1, . . . , N and let Xi = Y^j=i Xj/n 
for n = Np with <p < 1. Let I\, . . . , Jjy be i.i.d. Bernoulli variables with 
Eli =P- If S T = (So, SJ) with So = S»=i^i and Si = X^=i a «-^' tnen for 
any Borel set T , 

(14) P(%€T) = P(S 1 /N€T\S /N = p). 

Let r T = (to,tJ) with to G 5R and n € 3ft d and let 

k(t) = N^logEexp^oSo + rJS ± ) 

N 
= N~ 1 ^2log(q + pe T0+T ^ a >). 

i=\ 

Table 2 
The 4-sample permutation tests with exponentially distributed errors and m = 10 



u 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


MC A 


0.9456 


0.6837 


0.3434 


0.1160 


0.0275 


0.0043 


0.0004 


MC ANOV 


0.9455 


0.6784 


0.3273 


0.0971 


0.0164 


0.0015 


0.0004 


xi 


0.9402 


0.6594 


0.3080 


0.0937 


0.0186 


0.0024 


0.0002 


SP LR A 


0.9491 


0.6888 


0.3456 


0.1174 


0.0272 


0.0043 


0.0004 


SP BN A 


0.9486 


0.6877 


0.3441 


0.1164 


0.0268 


0.0042 


0.0004 
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Table 3 
The 3 -dimensional two sample parmutation test 



u 


0.3 


0.4 


0.5 


0.6 


0.7 


MC A 


0.3543 


0.1249 


0.0276 


0.0041 


0.0006 


xl 


0.3080 


0.0937 


0.0186 


0.0024 


0.0002 


SP LRA 


0.3528 


0.1207 


0.0282 


0.0045 


0.0005 


SP BN A 


0.3507 


0.1194 


0.0278 


0.0043 


0.0005 


Quadratic 


0.3325 


0.0939 


0.0135 


0.0004 


0.0001 



Let r be the solution of k'(t) = (p,x T ) T , and let A(p,x) = fop + tJx. — 
k(t). Consider sets J- = {x:A(p,x) > A}. Then from Theorem 2, we can 
approximate (14) by (8) or (9) where G(u) is given by (12) and (13) with 
do = 1 and d\ = l. 

3.4. Numerical results for two-sample test. Consider the test based on 
two samples of size 40 from a 3-variate exponential distribution with mean 1 
and covariance matrix /. After standardizing the combined sample we con- 
sider tests based on the statistic A(X) or XX T , equivalent to the usual 
normal theory based statistic. We calculate the tail probabilities based on 
Theorem 2 in this case and Monte Carlo approximations to the permutation 
tests based on 10,000 random permutations. Table 3 demonstrates the ac- 
curacy of the two saddlepoint approximations throughout the range. It also 
shows that the chi-squared approximation is not satisfactory either for A or 
for the classical quadratic form statistic. However, while we have accurate 
tail probability approximations for the new statistic, such approximations 
are not available for the classical quadratic form. 

4. Proofs of the main results. For notational convenience we will re- 
strict attention to the case do = 0, as details of the case conditional on 
lattice variables follow in a straightforward manner. The following theorem 
is a simplified version of Theorem 1 of [11], taking s = 5, do = and A as 
a d-dimensional cube in X with center a and side 5 = n . As in (1.10) 
of [11], let 

e 2 (y, fjL T ) = (1 + Qi(y*) + Q 2 (y*))(2vr/n)- d / 2 |y T r 1 / 2 e^ T ^/ 2 

with y* = n^^V-r (y — k'(t)), be the formal Edgeworth expansion of or- 
der 2 for X T = J2i=i Xj T /n, and let 

e2 (T,5,x-«(T)) = j e nTT ^e 2 (y,n T )dy. 

The terms Q\ and Q2 are given explicitly in (1.11) of [11], and are terms of 
order n -1 ' 2 and n _1 , respectively. 
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Theorem 3. For any set £ C A and e > 0, take £ £ = {z : 3y G £ , ||z 
y|| <e}. Choose eG (0, c/n 2 ), and let T = l/e. For xg£c X, 

\fi(£) - e- n(TTx - K(T)) e 2 (r,£,x - m(r))| < e-^^^^^R 
for 

R = C[Yol(£ 2e )( V5 (r)n~ 3 / 2 + \V T \^ 2 n^ 2 T d q T (T)) + Vol(£ 2£ - £. 2s )]. 

Note that this follows, since 



X-r,£ 2e (0)= / e" TT ( u - a )^<CVol(£: 2£ ) 

as £ C ^4 implies that ||u — a|| < c(5 + 2e) < cn _1 . 

We give a preliminary lemma before proceeding to the proof of Theorem 1, 
using the notation k(t(x)) = x and A(x) = r(x) T x — k(t(x)) for x£^, 

Lemma 1. For x.e£ c Ac X, 

(15) f r(y)dy - e n ^ T ^- T ^ T ^e 2 {r,£,0) = f r(y)dyO(l/n). 

Proof. Ignoring for the moment the terms involving Qi and Q 2 , the 
left-hand side in (15) is 



(16) J r(y) 



e n(A(y)-A(x)-T(x) T (y-x)-(y-x) T V;-i ) (y-x)/2) 



|F r(x) |V2/|^ (y) |l 



/2 



dy- 



Noting that ||y — x|| = 0(l/n), and using a Taylor series expansion about x, 
we see that the exponent in (16) is 0(l/n 2 ), and the denominator is 1 + 
0(l/n). So in (15) the first term on the left is as given by the expression 
on the right. Noting that Qi(0) = 0, we see that the term involving Q\ is of 
the same form. The proof is completed by noting that the term Q 2 is also 
of this form. □ 

Proof of Theorem 1. The proof will proceed by dividing X into small 
rectangles, applying Theorem 3 on each of these rectangles, and summing the 
results in a manner similar to that of [9]. For j G r L\, let A^ = {x G M : xi G 
((ji-d — ^)<5, (ji-d + ^)^]}> and let £J = Aj n T . By the intermediate value 
theorem, on each £ J , there is an Xj such that 

r(x)dx = r(x(j))Vol(£ j ). 
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Note that T = (Jj G j£ J an d £ J are disjoint. Define Tj so that Xj = k'(tj). 
Write J = {j : Vol(£J) > 0}. Then 

H(F) - j t(*) dx = Y^W*) - r(xj) Vol(£J)] 



^ jeJ 



where 



and 



— E\ + E 2 , 

i? 1 = ^[#)-r(x j )(27r/n) d / 2 |^ j | 1 /2 e2(f . )£: j i0)] 
jeJ 



^2 = "E / r-(y)dy-e" K ^)-^ e2 (fj,£:J,0) . 

Using Lemma 1 on each S i and summing, we have 

E 2 = [ r(y)dyO(l/n). 

Now consider E\ . Apply Theorem 3 to each £ J , and sum to get 

(17) \E 1 \<J2r( Xj )(R lj + R 2j ), 

JGJ 

where 

R Xi = cvoic^MTjjn- 1 + l^/zv/^cr)] 

and 

i? 2j = Voi(4-£ J l 2e ). 

The summation of these terms is complicated by the fact that the sets are 
not disjoint and not all are subsets of T . So introduce sets Hj = Aj n Re- 
consider the set W , the union of Wj and the 3 d — 1 sets formed by reflections 
of T~Lj in each of the lower-dimensional faces of .Aj. Then £j C T~L* so 

Vol(«£ , j 2£ )/Vol(^ j )<3 d . 

So 

^r(x j )U ]J <^r(x j )Vol(Wj)0(l/n)= / r(y)dyO(l/n). 
jeJ jGJ ■ / - F2s 

Also 

Vol(4 - £ J l 2e )/ Vol(Wj) < C £ /<5 = 0(l/n). 
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Using this to bound the second sum on the right-hand side of (17) and the 
previous bound for the first term gives 



\Ei\ 



r{y)dyO{l/n). 



T^ 



Note that for any x such that A(x) = A and any z G Tie-, 
A(z) > A(x) - | (z - x) T A'(x)| > A - Ce. 
So the theorem follows by noting that 



[3] 

[i] 
[5] 
[6] 

[7] 

[8] 

[9] 

[10] 

[11] 
[12] 



Tie 



r(y)dy= / r(y)dy(l + 0(l/n)). 



□ 



t 
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